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Abstract 

T-H , We study the problem of computing a preemptive schedule of equal-length jobs with given 

' release times, deadlines and weights. Our goal is to maximize the weighted throughput, which 

, is the total weight of completed jobs. In Graham's notation this problem is described as 

' {l\rj;pj=p;pmtn\'^WjUj). We provide an 0(n^)-time algorithm for this problem, improving 

. , the previous bound of 0{n^") by Baptiste |Bap99b| . 

1 Introduction 

(N ■ 

' We study the following scheduling problem. We are given a set of n jobs of the same integer 

Cf^ . length p > 1. For each job j we are also given three integer values: its release time rj, deadhne 

I dj and weight Wj > 0. Our goal is to compute a preemptive schedule that maximizes the weighted 

^ ■ throughput, which is the total weight of completed jobs. Ahernatively, this is sometimes formulated 

. as minimizing the weighted number of late jobs. In Graham's notation, this scheduling problem is 

^ I described as {l\rj;pj=p; pmtn| ^ wjUj), where Uj is a 0-1 variable indicating whether j is completed 

^ ' or not in the schedule. 

. Most of the literature on job scheduling focuses on minimizing makespan, lateness, tardiness, or 

I other objective functions that depend on the completion time of all jobs. Our work is motivated by 

^ ' applications in real-time overloaded systems, where the total workload often exceeds the capacity 

■ of the processor, and where the job deadlines are critical, in the sense that the jobs that are 

not completed by the deadline bring no benefit and may as well be removed from the schedule 
altogether. In such systems, a reasonable goal is to maximize the throughput, that is, the number 
of executed tasks. In more general situations, some jobs may be more important than other. This 
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can be modeled by assigning weights to the jobs and maximizing the weighted throughput (see, for 

example, |KS95j). 

The above problem {l\rj;pj=p;pmtn\ ^^WjUj) was studied by Baptiste |Bap99b| , who showed 
that it can be solved in polynomial time. His algorithm runs in time 0{n^^). In this paper we 
improve his result by providing an 0(n^)-time algorithm for this problem. 



1 b» =P; rj ; pmtnjX; Uj 1 1 ; pm tn| J2 U j 1 \Pi =P; I E 

0{n\ogn) |Law94| 0{it') ,La^ ©(n^logn) |Car81| 

0(n4) |ljap99a| 

2|rj;pmtn|X]J7,- 

NP-hard DLW92 

l|pj=p;»'j;pnitn| l|rj; pmtn|^WjC/j- P\pi^p; p mtn| J2 '^jUj l|pi=p ; rj\Y, Wj U] 

0(n4) [this paper] NP-hard fHTZ^ NP-hard (ijK99| 0{n') |ljap99b| 

was 0(n^°) |Bap99b| pseudo-polynomial 

|Law90| Prn |pi pmtn I ^ WjJTj Pm\pi — P]'rj\'Y^WjUj 

if ri < Tj ^ Wi > w 



0(n2'"™!) BapOOj (^(ne^+i) |BBKT02| 



O(nlogn) ' Oinmim^^d.r) [Ha^ 



Figure 1: Complexity of some related throughput maximization problems. 



Figure ^ shows some complexity results for related scheduling problems where the objective 
function is to maximize throughput. A more extensive overview can be found at Brucker and 
Knust's website jBKj . (That website, however, only categorizes problems as NP-complete, polyno- 
mial, pseudo-polynomial or open, without describing their exact time complexity.) 



2 Preliminaries 



Terminology and notation. We assume that the jobs on input are numbered 1,2, ... ,n. All 
jobs have the same integer length p > 1. Each job j is specified by a triple {rj,dj,Wj) of integers, 
where rj is the release time, dj is the deadline, and Wj > is the weight of j. Without loss of 
generality, we assume that dj > rj + p for all j and that min^ rj = 0. 

Throughout the paper, by a time unit t we mean a time interval [t,t + l), where t is an integer. 
A preemptive schedule (or, simply, a schedule) S is a function that assigns to each job j a set S{j) 
of time units when j is executed. Here, the term "preemption" refers to the fact that the time 
units in S may not be consecutive. We require that S satisfies the following two conditions: 

(schl) S{j) C [rj,dj) for each j (jobs are executed between their release times and deadlines.) 

(sch2) S{i) n S{j) = for i 7^ j (at most one job is executed at a time.) 

If t G S{j) then we say that (a unit of) j is scheduled or executed at time unit t. If \S{j)\ = p, 
then we say that S completes j. The completion time of j is Cj = 1 + max S{j). Without loss of 
generality, we will be assuming that each job j is either completed {\S{j)\ = p) or not executed at 
ah (S(j) = 0). 
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The throughput of S is the total weight of jobs that are completed in S, that is 'w{S) = 
^\S(j)\=p''^j- Onv goal is to find a schedule of all jobs with maximum throughput. 

For a set of jobs J, by w{J') = Ylj^j '^j '^^ denote the total weight of J. Given a set of jobs J, 
if there is a schedule S that completes all jobs in J', then we say that J' is feasible. The restriction 
of 5 to i7 is called a full schedule of J'. 

Earliest-deadline schedules. For two jobs j, k, we say that j is more urgent than k if dj < d^- 
It is well-known that if J is feasible, then J can be fully scheduled using the following earliest- 
deadline rule: at every time step t, execute the most urgent job among the jobs that have been 
released by time t but not yet completed. Ties can can be broken arbitrarily, but consistently, for 
example, always in favor of lower numbered jobs. If S is any schedule (of all jobs), then we say 
that S is earliest-deadline if its restriction to the set of executed jobs is earliest-deadline. 

Since any feasible set of jobs J can be fully scheduled in time O(nlogn) using the earliest- 
deadline rule, the problem of computing a schedule of maximum throughput is essentially equivalent 
to computing a maximum- weight feasible set. 
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Figure 2: Examples of earliest-deadline schedules with p = 3. The rectangles represent intervals 
[rj,dj), and the shaded areas show time units where jobs are executed. The first schedule consists 
of two distinct blocks. After removing the least urgent job 5, the second block splits into several 
smaller blocks. 



Each earliest-deadline schedule S has the following structure. The time axis is divided into busy 
intervals (when jobs are being executed) called blocks and idle intervals called gaps. Each block is 
an interval [r^, Cj) between a release time and a completion time Cj, and it satisfies the following 
two properties: (bl) all jobs executed in this block are not released before rj, and (b2) Cj is the 
first completion time after rj such that all jobs in S released before Cj are completed at or before 
Cj. Note that Cj — ri = ap, for a equal to the number of jobs executed in this block. Figure [21 
shows two examples of earliest-deadline schedules. 

In some degenerate situations, where the differences between release times are multiples of p, a 
gap can be empty, and the end Cj of one block then equals the beginning r^ of the next block. 

The above structure is recursive, in the following sense. Let k be the least urgent job scheduled 
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in a given block [ri,Cj). Then the last completed job is k. Also, when we remove job k from the 
schedule, without any further modifications, we obtain again an earliest-deadline-schedule for the 
set of remaining jobs (See Figure E)). The interval [rj,Cj) may now contain several blocks of this 
new schedule. 



2.1 An 0{n'^)-Tim.e Algorithm 

We assume that the jobs are ordered 1,2, according to non-decreasing deadlines, that is 

di < d2 < . ■ ■ < dn- Without loss of generality we may assume that job n is a "dummy" job 
with Wn = and r„ = dn-i (otherwise, we can add one such additional job). We use letters 

k, I € [1, n] for job identifiers, and a, 6 € [0, n] for numbers of jobs. 

Given an interval [x, y), define a set J' of jobs to be {k, x, y) -feasible if 

(fl) JC{1,2,...,A;}, 

(f2) rj e [x, y) for all j € J , and 

(f3) J has a full schedule in [x, y) (that is, all jobs are completed by time y.) 

An earliest-deadline schedule of a (A;, x, y)-feasible set of jobs will be called a {k^x^y)- schedule. If 
ties are broken consistently, then there is a 1-1 correspondence between feasible sets of jobs and 
their earliest-deadline schedules. Thus, for the sake of simplicity, we will use the same notation J 
for a feasible set of jobs and for its earliest-deadline schedule. 

Note that if e is the job with the earliest release time, then an optimal (n, re, rri)-schedule is 
also an optimal schedule to the whole instance. The idea of the algorithm is to compute optimal 
(A;, Tj, rj)-schedules J-f^ in bottom-up order, using dynamic programming. As there does not seem 
to be an efficient way to express J-f^ in terms of such sets for smaller instances, we use two auxiliary 
optimal schedules denoted Qf^ and Ti-^ j on which we impose some additional restrictions. 

We first define the values F^^, Gfj, and Hf^^ that are meant to represent the weights of the 
corresponding schedules mentioned above. The interpretation of these values is as follows: 

F^- = the optimal weight of a (A;, rj, rj)-schedule. 

G^^ = the optimal weight of a [k, ri,ri -|- ap)-schedule that consists of a single block 
starting at time rj and ending at rj -|- ap. 

H^- = the optimal weight of a (/c, rj, rj)-schedule that has no gap between rj and 
In Ff^- and Hf- we assume that rj < rj. In Hf- we additionally assume that k < n and 

Ti < Vk+l. 

We now give recursive formulas these values. In these formulas we use the following auxiliary 
functions: 



y-x 



1 



A(x, y) = min < n, 

I P 

X{x) = argmiUjjrj : rj > x} 
A^(x) = argmiUjjrj : rj > x} 
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Thus A(x, y) is the maximum number of jobs (but not more than n) that can be executed between 
X and y (ignoring release times and dcadhnes), such that the interval [x,y) is not completely filled. 
For X < Tni \{x) denotes the first job released at or after x. Similarly, for X < Tfi^ A^(x) is the first 
job released strictly after x. (Ties can be broken arbitrarily). 

Values F^-. If Vj = Vi then F^j = 0. Otherwise, F!^^ is defined inductively as follows: 



Ff j = max < 



l<o<n 
^ ri+ap<rj 



Note that in (Fl) X'^{ri) is well defined since rj < rj, and in (F2) A(rj + ap) is well defined since 
fi + ap < Tj . 

Values If k = or a = 0, then G^,^ = 0. If ^ [ri,ri + (a — l)p] or dk < n + ap then 

Gf „ = Otherwise, ^ is defined as follows: 



G^-'.^+Wk (G2) 



G'L = max<( 



, rk<ri<ri+ap 



Values H^j. If Tj = r-i then i/.^^ = 0. If /c = n or r^+i ^ [rj, r^] then H^j is undefined. For other 
values is defined inductively as follows: 



Algorithm DP. The algorithm first computes the values F^j, G^^, H^- bottom-up. The general 
structure of this first stage is as follows: 



for A; to n do 

for i^n downto 1 do 
for a ^ to n do 

compute G^^ 
for J <— z to n do 

compute F^- and H^, 

The values F^-, G^^, and H^^ are computed according to their recursive definitions, as given 
earlier. At each step, we record which value realized the maximum. 

In the second stage, we construct an optimal schedule where e is the job with earliest 

deadline. This is achieved by starting with and recursively reconstructing optimal schedules 
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r.+ap ?i(r.+ap) 



(Fl) 



r. 

J 



(F2) 



r. 
J 



r. 
1 



1 A(r^, r.+ap) 



r.+ap 



(G3) 



(H) 



k+1 



r+ap ?i(r+ap) 



Figure 3: Graphical explanation of the recursive formulas for Fj'j, G^^ and H^j. Shaded regions 
show blocks. In (G3), the whole schedule is one block, and darker shade shows where k is executed. 



J^ij, Gfa ^'^d T^fj- that realize weights Ff'j,Gf,^, and H^j, respectively, according to the following 
procedure. 

Computing T^j. If F^j = 0, return Tj'j = 0. If Ff'j was maximized by choice (Fl), let = 
■^A+(r ) j" maximized by choice (F2), let Tf j = Q^^^ U ^\{r-j^ap) where a is the integer 

that realizes the maximum. 

Computing G^^. If G^^ = 0, return = 0. If Gf„ is reahzed by choice (Gl), let = G^-^. 
If is realized by choice (G2), let G^a = Si,a-i U {k}. If is reahzed by choice (G3), let 
Gia = "^iT^ ^ ^lA^ri r +ap) ^ 'f^J'' '^^^^e I is the job that realizes the maximum in (G3). 

Computing "Hf^-. If Hl^j = 0, return Hfj = 0. Otherwise, H^j = Gia ^ •^\{r +ap) where a is the 
integer that realizes the maximum. 



Theorem 2.1 Algorithm DP correctly computes a maximum-weight feasible set of jobs and it runs 
in time 0{n'^). 

Proof: The time complexity is quite obvious: We have O(n^) values F^^ G^^ > ^ij^ ^'^^ they 
can be stored in 3-dimensional tables. The functions A(-,-), A"*'(-), and A(-) can be precomputed. 
Then each entry in these tables can be computed in time 0{n). The reconstruction of the schedules 
in the second part takes only time 0{n). 

To show correctness, we need to prove two claims: 
Claim 1: 

(If) w{Fij) = F!^- and is a (A;, r^, rj )-schedule. 

(Ig) w{Gig) — ^ia ^ia , + op) -Schedule that consists of a single block of a jobs 

starting at r^. 
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(Ih) w{H^j) = H^- and ■ is (A;, rj, rj)-schedule that has no gap before r^+i (assuming 
that k < n and < rk+i < rj.) 

Claim 2: 

(2f) If J' is a (A;,rj,rj)-schedule then w{J) < F^-. 

(2g) If JT" is a {k, ri,ri + ap)-schedule that is a single block of a jobs then w{J) < Gf^. 

(2h) If ^7 is a (fc, Ti, rj)-schedule that has no gap before Vk+i then w{J') < H-^j (assuming 
that k < n and < r^+i < Vj.) 

We prove both claims by induction. We first define a partial order on all function instances 
Fj'-, G^„ and Hf^-. We first order them in order of increasing k. For a fixed k, we order them in 

order of increasing length of their time intervals, that is, rj — Vi for Fj^j and H^j, and ap for G^^. 
Finally, for a fixed k, i and j, we assume that F-^j is before Hl^ j. The induction will proceed with 
respect to this ordering. 

We now prove Claim 1. The base cases are when A; = or a = in Gf „, or = rj in F^j and 
H^j. In all these cases Claim 1 holds trivially. We now examine the inductive steps. 

To prove (If), if was constructed from case (Fl), the claim holds by induction. If J^j'j was 
constructed from case (F2), let a be the integer that realizes the maximum and I = A(rj+ap). Since 
Vi + ap < ri, sets Q^^^ and J^f- are disjoint, and so arc the intervals [rj,rj + ap), [n,rj). Thus both 
and J^f - can be fully scheduled in [ri,rj) and w{J='^j) = w{g^^J + w{Tf -) = + F^- = F^^j, 
by induction. 

To prove (Ig), if is realized by case (Gl), the claim is obvious. In case (G2), we have 
^ ^ ^ia-i' fk^Ti + {a — and dk > Vi + ap. Thus we can schedule Q^~\, and then schedule k 
at ri + {a- l)p. By induction, w{gl^) = w{g'l~\) +Wk = Cf'^ + Wk = G^^. In case (G3), let / 
be the job that realizes the maximum and b = A(r;, + ap). The sets 'H'^J^ and Gi^^ are disjoint 
and so are the intervals [ri,r/), [r;,r; + bp). By the definition of b, we have ri + bp < ri + ap, so 
there is a non-zero idle time in Ti-^J^ U G^^^ between r; + bp and + ap. Since the total interval 
[rj,rj + ap) has length ap, the total idle time in this interval must be at least p. Moreover, all 
gaps occur after r^. This implies that we can schedule job k in these idle intervals. Also, note that 
w{^ta) = ^(^tr') + ^(^i,b') + ^k = Hlf' + Gl^' +Wk = „, so the claim holds. 

To prove (Ih), let a be the integer that realizes the maximum in (H) and I = X{ri+ap). As before, 
since ri+ap < ri, sets Gia and J^f- are disjoint, and so are the intervals [r^, rj+ap), [r;, rj). Thus both 
Gla and can be fully scheduled in [n,r,) and w(HI^) = w{Gl,) + w^T^^) = + F^. = H^-, 
by induction. 

We now show Claim 2. Again, we proceed by induction with respect to the ordering of the 
instances described before the proof of Claim 1. The claim holds trivially for the base cases. We 
now consider the inductive step. 

To prove (2f), we have two cases. If J' does not start at r^, then it cannot start earlier than 
at Vm, for m = A^(rj), so the claim follows by induction. If J starts at rj, let a be the length of 
its first block. The second block (if any) cannot start earlier than at ri, for I = X(ri + ap). (Note 
that there might be no gap between the blocks.) We partition J' into two sets: J7i containing the 
jobs scheduled in [ri, ri + ap) as a single block, and J^2 containing the jobs scheduled in [r;, rj). By 
induction, w{J) = w{Ji) + w{j2) < G^^ + Fj". < F^^. 
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We now prove (2g). l{k(^ J then J \s a (/c- 1, r^, + ap)-schedule, so w{J) < G^'^ < by 
induction and by case (Gl). Now assume that k £ J. If job k has not been interrupted, then J—{k} 
is a (/c — l,ri,ri + (a — l)p)-schedule. Thus, by induction and (G2), w^J) < G'^~\ + Wk< G^^. 

Otherwise, let / be the last job that interrupted k. Starting at r;, J' executes b = A{ri,ri + ap) 
jobs with deadlines smaller than dk, after which it executes a portion ri + ap — ri — bp > of job k. 
We partition J' — {k} into two sets: J'l containing the jobs scheduled before r;, and J^2 containing 
the jobs scheduled after r/. Note that JiD = <J — {k}, since the jobs scheduled before ri must 
also be completed before r/ and the other jobs cannot be released yet. By induction, sets J'l is a 
(A; — 1, Tj, r;)-schedule in which the first block starts at and ends after r^, and J2 is a single block 
starting at r; and ending at r; + bp. Thus, by induction and (03), w{J') = w{Ji) + w{J2) +Wk < 

The proof of (2h) is similar. We have two subcases. Suppose first that r^+i = r^. By induction, 
we have wlJ) < F^j = Hfp since in this case we can choose a = in (H). If r^+i > r^, then the 
first block of J starts at rj and ends after rk+i- Let a be the length of its first block. The second 
block (if any) cannot start earlier than at r;, for / = A(rj + ap). We partition J into two sets Ji 
containing the jobs scheduled in [rj, + ap) as a single block and ^2 containing the jobs scheduled 
in [ri,rj). By induction, w{J) = w{Ji) + u'( J2) < Gf_„ + F^- < H^.. □ 

3 Final Remarks 

Several open problems remain. Although our running time O(n^) for the scheduling problem 
(l|rj;pj=p; pmtn| Yl'^j^j) substantially better than the previous bound of 0(n^'^), it would be 
interesting to see whether it can be improved further. Similarly, it would be interesting to improve 
the running time for the non-preemptive version of this problem, {l\rj;pj=p\^WjUj)., which is 
currently 0{nJ) |Bap99a . 

In the multi-processor case, the weighted version is known to be NP-complete |BK99j . but 
the non-weighted version remains open. More specifically, it is not known whether the problem 
(P|rj;pj=p; pmtn| ^ C/j) can be solved in polynomial time. (One difficulty that arises for 2 or 
more processors is that we cannot restrict ourselves to earliest-deadline schedules. For example, 
an instance consisting of three jobs with feasible intervals (0,3), (0,4), and (0,5) and processing 
time p = 3 is feasible, but the earliest-deadline schedule will complete only jobs 1 and 2.) In the 
multi-processor case, one can also consider a preemptive version where jobs are not allowed to 
migrate between processors. 
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